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Abstract 

The paper gives a characterisation of the chain relation of a sofic subshift. Every sofic 
subshift E can be described by a labelled graph G. Factorising G in a suitable way we obtain 
the graph G/k that offers insight into some properties of the original subshift. Using G/ ra we 
describe first the chain relation in E, then characterise chain-transitive sofic subshifts, chain- 
mixing sofic subshifts and finally the attractors of the subshift dynamic system. At the end we 
present (straightforward) algorithms deciding chain-transitivity and chain-mixing properties 
of a sofic subshift and listing all the attractors of the subshift system. 
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1 Introduction 

The properties of subshifts play a key role in symbolic dynamics and in the study of various dynam- 
ical systems (for example, cellular automata). The chain relation is an important characteristic of 
subshifts as well as of dynamical systems in general. 

This paper addresses the question how, given a sofic subshift E (described by a labelled graph), 
one can deduce certain properties of this subshift: what does its chain relation look like, whether 
the subshift is chain-transitive or chain-mixing and what are the attractors of the system (£, a). 
While the answer to this question was previously known in the special case of subshifts of finite 
type (a subshift £ is of finite type when x G £ iff x does not contain any factor from a given finite 
set F of forbidden words.), it could not be directly applied to the more general sofic subshifts. 

The core idea of this paper is to factorise the graph G describing the sofic subshift £. By 
identifying the pairs of vertices whose restricted follower sets have infinite intersections, we obtain 
the linking graph ofG. We can then deduce certain properties of £ from this linking graph. We shall 
formulate an explicit description of the chain relation (Theorem fH)) . obtaining the characterisation 
of chain-transitive sofic subshifts as a corollary ( Corollary [15)) . Then we focus on the attractors of 
(£,cr), characterising them (Theorem ll9p . and use the description of the chain-transitive property 
for the explicit description of the chain- mixing property (Theorem 1161) . Finally, we show that we 
can check all these properties algorithmically. 

The properties discussed have several applications. The chain relation tells us whether we can 
get from xG£toi/££ma certain way. Several theorems connect this relation and the attractors 
of (£,<r). 

The chain-mixing property can be used in the study of cellular automata. In [5] , it is shown 
that each subshift attractor (an attractor that is also a subshift) of a one-dimensional cellular 
automaton must be chain-mixing. Using the algorithms from this paper, it is now possible to 
conclude that many sofic subshifts can never be attractors of cellular automata because they are 
not chain-mixing. Unfortunately, being chain-mixing is not a sufficient condition to be an attractor 
(see examples in [2]). 

2 Preliminaries 

In this section we shall define several key terms from symbolic dynamics. 
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Let A be a finite alphabet. Then A* resp. A z is the set of all finite resp. (two-sided) infinite 
words consisting of letters from A. Given a (finite or infinite) word x denote by i, £ i the i-th 
letter of x. Note that the set A* contains the empty word A. 

We can write nonempty finite words asu = u\U2 ■ ■ ■ u n . Denote the length of u as \u\ and define 
the concatenation operation as uv = u\ ■ ■ ■ u n vi ■ ■ -Vk where n is the length of u and k the length 
of v. We adopt the notation ui^n for the word UkUk+i 

Let u, v be words (v may be finite or (one- or two-sided) infinite, u is finite) in alphabet A. We 
say that u is a factor of v and write u C v iff there exist k, I 6 Z such that u — v^.i] or if u = A. 

The shift map is the mapping a : A z — > A z given by cr(x)j = Xj+i. 

The set A z can be understood as a metric space: For 1/1/6 A z let p(x,y) = 2~ n , where n 
is the minimal nonnegative integer such that x n 7^ y„ or x_„ ^ y_ n . If x = y, set p(x,y) = 0. 
Then p is a metric on A L . The space A z equipped with the topology induced by p is sometimes 
called the Cantor space. Topologically, A z is a product of compact discrete spaces A and it is thus 
compact. 

The set E C A z is called a subshift iff it is topologically closed and strongly invariant under a, 
that is, <t(£) = S. 

Notice that a subshift is a closed subspace of the compact space A z and is thus itself compact. 

A dynamical system (A, F) is a pair consisting of a compact metric space X and a continuous 
map F : X — > X. An example of a dynamical system is the pair (E, <r) where £ C A z is a subshift 
and a is the shift map. 

Let S be a subshift. The language o/E, denoted by £(£), is the set of all factors of words from 
S: £(E) = {v\3x x}. 

Remark. The language £(E) is central (some authors use the term extendable), that is: 

• For every w C i> £ £(£) we have w € £(£). 

• For every w G £(E) there exist nonempty words u,w such that www G -C(E). 

We say that a language is regular if it can be recognised by some finite automaton (a machine 
with finite memory). A subshift whose language is regular is called sofic. 

A labelled graph G over alphabet A is an oriented multidigraph (a graph where we allow multiple 
parallel edges and loops) whose edges are labelled with the letters from alphabet A. More precisely, 
the labelled graph is a quintuple (V(G), E(G), s,t,l) where V(G) is the set of vertices, E(G) the 
set of edges and the mappings s, t : E(G) — > V(G) and I : E(G) — > A assign to each edge e its 
starting and ending vertex and its label, respectively. Both V(G) and E{G) must be finite. 

A subgraph H of a labelled graph G = (V(G), E(G), s, t, I) (over alphabet A) is a labelled 
graph (V{H),E(H),s H ,t H ,l H ) such that V(H) C V(G),E(H) C £(G) and the mappings s/r,ijj 
and Ijy are restrictions of s, i and /, respectively. The subgraph of G induced by the set U is the 
subgraph H of G such that F(-ff) = t/ and the set E(H) contains all the edges e £ -E(G) such 
that s(e),i(e) e F(i?). 

We call a subgraph H of G terminal if there is no edge e G i?(G) starting in v G V(.ff) and 
ending in u G V^(G) \ V(H). Similarly, we define initial subgraph H of G as a subgraph such that 
no edge e G £7(G) starts in v G V(G) \ V(H) and ends in u G V(H). 

A waZA; of length / in a graph G is any sequence t>oeit> 1 • ■ • eivi where Vi G V(G), Ci G E(G) are 
vertices and edges of G and the edge e^+i leads from to v i+1 in G for each i = 0, 1 . . . , I — 1. 
We allow walks of length zero (a single vertex). We also define biinfinite walks as sequences 
. . . v_2e-if_ieoVoeiVie2f2 ■ • ■ such that the edge e^+i leads from Vi to v i+ i in G for each i £ Z. 

A walk of length < I < 00 is closed if «o = u/. 

As any walk of nonzero length is uniquely determined by its sequence of edges, we shall often 
use a shorthand description of walks, writing down just the edges. 

A labelled graph G is called connected iff for all u, v G V(G) there exists a walk from u to v. 
(To be precise, this is the definition of a strongly connected graph. However, we do not consider 
weak connectivity in the article and so we can safely omit this adjective.) We allow walks of length 
zero, so a single vertex graph is considered connected in this paper. 
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A labelled graph G is periodic iff V{G) can be partitioned into n > 1 disjoint sets of vertices 
Vo, V%, . . . , V n -i such that every edge e G -E(G) leads from some n 6 14 to some u E Vk+i (here 
V n — Vo) for a suitable fe. A graph is aperiodic iff it is not periodic. 

Given a graph G, we can partition G into its maximal connected subgraphs K±, K%, . . . , K n . 
These graphs are obviously disjoint and for every vertex v € G there exists i such that v G V(Ki) 
(we allow single- vertex components). Call K\, . . . , K n the components of G. 

The subshift of a labelled graph G is the set £(G) of all a; 6 yl z such that in G there exists a 
biinfinite walk {ej}j e z such that ?(ej) = Xi. It is easy to verify that £(G) is indeed a subshift of 
y4 z . Call the language of £(G) i/ie language of the graph G and denote it by C(G). 

A vertex u of a graph G is called stranded iff it does not have at least one outgoing and at least 
one ingoing edge (a loop counts as both). 

A graph G is called essential iff it does not contain stranded vertices. 

It is easy to see that for an essential G a (finite) word u = uiu 2 ■ ■ ■ Ufc belongs to the language 
C(G) of G iff there exists a walk e±e2 ■ ■ ■ ej, in G such that the edge ej is labelled by the letter itj. 
Call such a walk a presentation of v in G. Because we allow empty walks, every C(G) also contains 
the empty word A. It can be shown that forgetting all stranded vertices of G does not change the 
language C(G), so we can safely limit ourselves to essential graphs. See [SJ p. 37] for details. 

Proposition 1. p. 133] The central language L is a language of a sofic subshift iff it can be 
obtained as the language of some labelled graph. 

For the sake of providing context we shall now briefly discuss transitivity and the mixing 
property. 

We say that the subshift £ is transitive, iff for every e > and every u, v G £ there exists k and 
w E £ such that p(u,w) < e and p(v, <r k (w)) < e. We say that £ is mixing if for every u,v G £ 
and every e > there exists n such that for every k > n there exists w such that p(u, w) < e and 
p(v, a k (w)) < e. 

It can be shown (see [3 p. 80-82, 127-129], note that the authors of [5] use the term "ir- 
reducible" for what we call "transitive") that a sofic subshift £ is transitive iff there exists a 
connected labelled graph whose subshift is £ and it is mixing iff there exists a connected aperiodic 
labelled graph whose subshift is £. 

The chain-transitivity and chain-mixing properties are weaker properties than transitivity and 
mixing, respectively. 

An e-chain (in £) of length n > from x° to x n is a sequence of words x°, x , . . . , x n G £ such 
that p(a(x i ), x l+1 ) < e for alii = 1, 2, . . . , n - 1. 

Let £ be a subshift. We say that £ is chain-transitive iff for every words x, y G £ and every 
e > there exists an e-chain in £ from x to y. We say that £ is chain-mixing if for every x, y G £ 
and e > there exists k G N such that for every n > k there exists an e-chain in £ of length n 
from x to y. 

Let £ be a subshift, x,y G £. We say that x and y (in this order) are in the chain relation, 
writing {x, y) G C if for every e > there exists an e-chain from x to y. 
Obviously, £ is chain-transitive iff C = £ X £. 

As we have a one-to-one correspondence between subshifts and central languages, we may 
call a language chain-transitive or chain-mixing meaning that its subshift has this property. We 
characterise these properties in terms of the language £(£). 

Definition 2. Let L be a language, u, v G L,\u\ — \v\ = m. The chain of length n from u 
to v in L is a word w, \w\ = n + m with prefix u and suffix v (more precisely, W\i j7n ] = u and 
^[|«;|— TO+i,|w|l = v ) such that if z C w, \z\ < m then z E L. 

Note that while e-chains can not have zero length, we do allow chains of length zero. 

Proposition 3. The pair (x, y) is in C iff for every I E N there exists a chain of nonzero length 
from £[_ M] to V[-i,i]. 



3 




Figure 1: The correspondence between e-chains and chains. 



Proof. Let first (x, y) lie in C, and let I 6 N. Consider u = Xr_j,/i , v = yr_j,/i , £ = 2 ' 1 . There exists 
an e-chain x = z°, z 1 , . . . , z" = y. Now consider the letters Oi = -Z/ +1 . As p(cr(z l ), z l+1 ) < 2 - ' -1 , 
it must be ^ z 1 _ 1> j +1 ] = Z[_j jZ+2 ]. 

Let u> = uaoai . . . a n -i- Then the word tu^x,»+2/+i] is equal to « (see Figured]) for every 
i = 0, 1, . . . , n and so all the factors of u> of length 21 + 1 belong to L. Because v = z™_ { , the word 
ua^ai . . . a n —i is a chain from u to v. Notice that the length n > of the original e-chain from x 
to y is equal to the length of the constructed chain from u to v. 

On the other hand, let without loss of generality e = 2~ l+1 for some I and assume that there 
exists a chain of nonzero length from u = aJ[-j j] to k = y\—i n. We shall prove that there exists an 
e-chain from x to y. 

Let w be the chain from u to u of length n > 0. Set z° = x, z" = y. Denote by z l ,i — 
1,2, ...,n — 1, an arbitrary infinite extension of the word i+2 i+i] m ^ (such an extension 

exists because w^ i i+ 2i] (z L, L is central and S is compact) with W[ i+l i+2 i+i] = n- Note that 
x and y are extensions of u and v respectively. We want to check that x = z , z , . . . , z n — y is 
an 2 _/+1 -chain: z^_ l+2 ^ = W[ i+3 i+2 i + i] — z ]^ll +1 i_u an d thus p(cr(z l ), z l+1 ) < 2~ l+1 . Again note 
that the length of the produced e-chain is equal to the length of the chain w. □ 

Corollary 4. Let L = £(£). 

1. The subshift £ is chain-transitive iff for every u,v 6 L, \u\ = \v\ = m there exists a chain of 
nonzero length from u to v 

2. The subshift E is chain-mixing iff for every u,v 6 L, \u\ = \v\ = m exists k € N such that for 
all n > k there exists a chain of length n from u to v. 

Proof. The first claim directly follows from Proposition^ To prove the second claim it is sufficient 
to notice that the length of e-chain and of the corresponding chain construed in the proof of 
Proposition [3] is the same. □ 



3 The Linking Graph 

The purpose of this section is to define the linking graph which provides useful tool for the study 
of the chain relation. 

Definition 5. Let G be a labelled graph and v be a vertex in G. The follower set of v in G is the 
language Fq(v) of all finite words that have presentations in G that start at vertex v. 
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Figure 2: An example of a linking graph. 



If v is a vertex of G belonging to a component K, we shall call the set Fk(v) the restricted 
follower set and denote it by F(v). 

Definition 6. Let G be a graph and v,w E V(G). We say that the vertices v,w are linked iff 
\F(v) n F(w)\ = oo (note that we consider restricted follower sets here) and write v <~ w. Denote 
by « the transitive and reflexive closure of ~. 

Lemma 7. The vertices v and w are linked iff for any positive integer n there exists a word 
z E F(v) (~1 F(w), \z\ = n. 

Proof. First observe that if yz E F(v) then y E F(v) as we can simply forget the ending of the 
presentation of yz to obtain a presentation of y. 

If now \F(v) <lF(w)\ = oo then F(v) CiF(w) must contain arbitrarily large words as A is finite. 
For a given n, consider y E F(v)(~)F(w), \y\ > n. There exist z\, z 2 such that \z\\ = n and z\z 2 = y. 
Thanks to the above remark we have that z\ E F(v) n F(w) and we are done. 

On the other hand, if F(v) (~l F(w) contains word z\ of length 1, z 2 of length 2 and so on, then 
there is an infinite subset {z\,Z2, . . . } of F(v) n F(w) and thus \F(v) D F(w)\ = oo. □ 

Definition 8. Let G be a labelled graph. The graph G/~, called the linking graph of G, is a 
graph obtained from G by joining all pairs of linked vertices. More precisely: The set V(G/~) is 
the set of all equivalence classes of « on V(G). An edge e (labelled by the letter a) goes from a 
vertex x to a vertex y in G/~ iff there exist vertices u E x, v E y such that an edge / labelled by 
a goes from u to v in G. 

In a labelled graph, a word might have multiple presentations. Define the projection 7r : V(G) — > 
V(G/fa) that assigns to every w E V(G) the equivalence class of v in w. We want to show that, for 
v long enough, the images under it of every pair of presentations of v intersect in a nice way. 

In the following, let c be the number of components of G. 

Lemma 9. Let t be a positive integer. Let the (nonempty) graph G consist of c components and let 
k > (t+ l)c 2 + c. Let ei, . . . , efe and fi,. .. ,fk ° e two walks in G. Then there exist two components 
M e , Mf of G and a positive integer r < k — t + 1 such that e i+r E M e and f i+r E Mf for every 

ie{o,i,2,...,t-i}. 

Proof. To prove the lemma, we shall use the pigeonhole principle. Without loss of generality, 
assume k = (t + l)c 2 + c. Since there are c components, at most c — I edges are spent traversing 
between components. Thus there exists an interval e s , e s +i, ■ ■ ■ , e s+c ( t+ i)_i of the walk ei, . . . , 
of length c(t + 1) that passes through only one component M e . 
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Figure 3: The two walks from Lemma[9l The thick arrows represent the interval r, r+1, . . . , r+t—1. 



Consider the walk f s ,f s +i, ■ ■ ■ , f s + c (t+i)-i- This is a walk of length tc + c and thus, using 
the above argument again, there exists an interval f r , / r +i, • • • , /r+t-i going through only one 
component Mf. But then e r+i € E(M e ) and f r+i E E(Mf) for % = 0, 1, . . . , t - 1. □ 

Corollary 10. For every (nonempty) labelled graph G there exists a length I such that if w G £(G) 
is of length at least I and v\e\V2&i ■ ■ ■ R\ w \v\ w \+\ and v^e^v^e^ ■ ■ ■ e [uj| ^'^i+i are ^ wo presentations 
of w in G, then there exists r such that 7r(iv) = ^(v' r ) (recall that tt is the projection from G to 

Proof. If uii 1 W2 £ V(G) are not linked then there exists a finite m Wl , W2 such that \Fx(wi) (~l 
F L (w 2 )\ — m w 1 ,w2- Take m — 1 + mffl{m miiM! : w\ 1 W2 S V(G) not linked}. It is then easy to 
see that any two vertices u, v £ V{G) are linked iff the intersection of their restricted follower sets 
contains a word of length m. 

Take I — (m + l)c 2 + c, and use Lemma[S]for the two presentations of w. We obtain that there 
exists r such that v r and v' r both contain the word W[ r r+m _ 1 ] in their restricted follower sets. Thus 
v r and v' r must be linked and so ir(v r ) — Tr{v' r ). □ 

The components of G/~ are partially ordered by the relation "the component K can be reached 
from the component L". Denote this relation by K > L. 

Definition 11. We call a sequence zq, . . . , z n of vertices of G a generalised walk if there exist edges 
ei, e2, . • • , e„_i G E(G) such that every leads from to some z' i+l « 

Notice that every walk in G/~ corresponds to a generalised walk in G. Informally, a generalised 
walk is a sequence of ordinary walks interleaved by occasional "jumps" to equivalent (under «) 
vertices. Crucial to our proof will be the following lemma describing a way to perform these 
"jumps". 

Lemma 12. Let u ~ u' be vertices of a labelled graph G. Let w be a word whose presentation ends 
in u. Then there exists a word w' whose presentation ends in u' such that \w\ — \w'\ and there 
exists a chain from w to w' . 

Proof. As u w i/, there exists a sequence of linked vertices u = vq ~ v\ ~ • ■ • ~ Vk — u'. Let us 
proceed by induction on k: 

1. If fc = then we are done, as it suffices to take w — w' . 



G 




Figure 4: Getting from w" to w' . 



2. Assume that the claim is true when k < n for some n. Let us have u = Do ~ • • • ~ u n -i ~ 
v n = v! . By the induction hypothesis, there exists a chain q from w to some u>" such that 
one presentation of w" ends in the vertex s = v n —\. 

It is s ~ u', so there exists z € F(s) fl F(u'), \z\ — \w"\. Because z £ F{u'), there exists a 
presentation of z beginning in u' that does not leave the component of u' . Call t the ending 
vertex of this presentation. There exists a walk from t back to u' which presents some word 
r (see Figure |4]). Consider now the sequence w"zr. It is easy to verify that this is a chain 
from w" to w' = (w''zr)n w n zr ii w /ri +1 \ w ri zr n and that w' has a presentation ending in u'. 
But then qzr is a chain from w to u/, ending the proof. 

□ 

Remark. If x is an infinite word and p, q two presentations of x then the images 7r(p) and 7r(g) 
of p and g under the projection 7r both begin in the same component a{x) and end in the same 
component u)(x) of G/~. 

Proof. Let K, L be the components of G/~ where n(p), Tt(q), respectively, end. These components 
are guaranteed to exist because G/~ is finite; they are the maximal (under the ordering <) com- 
ponents visited by ir(p) resp. n(q). There exists n such that the intervals [n, oo) of n(p), n(q) both 
stay in K, L. But due to Corollary [TU] there must exist some v £ P[ nt0 o) an< i v ' G 1[n,ao) such that 
7r(i>) = tt(v') and so K — L. Similar argument holds for the beginnings of p, q. □ 

Remark. Let G be a nonempty labelled graph. Let H be a subgraph of G/~ and let K — tt~ 1 (H). 
Denote by L the subgraph of G induced by the vertices not in K (see Figure [S]). Then there exists 
m such that no word of length at least m may simultaneously have a presentation in K and in L. 

Proof. Let m be equal to the constant from Corollary [TO] This Corollary tells us that if some u 
of length m had a presentation in both K and L then there would exist two vertices u £ K,v £ L 
such that 7r(u) = ir(v). But the whole set ir~ l (u) must lie either in K or in i, a contradiction. □ 

Lemma 13. Let H be a terminal subgraph of G/~, let K,L and m be as in Remark^ Let u be 
a word of length I > 2m + 1 with a presentation in K and let there lead a chain from u to some v. 
Then ur m +i,j] has a presentation in K . 

Proof. Take the chain w of length n from u to v. We prove this lemma by mathematical induction 
on n. 

1. For n = the claim is trivial. 
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Figure 5: Example of the situation in Remark [3] and Lemma ITTT1 



t — l/7[ n _]_ m _j_x — /,n+m] 
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Figure 6: Factors of w and v in detail. 



2. Let all lengths smaller than n satisfy the condition. Observe that Wh n ± m i is a chain from u 
to t = u\ n+m+ i_ l;n+m -\ of length n + m — I < n (because I > m). Thanks to the induction 
hypothesis we know that t\ m +\n has a presentation in K. It is v = u>[„ + i, n+ ;] and so 
v [i,m] — w [n+i,n+m] = t[i-m+i,l] nas a presentation in K (here we use that l — m + 1 > m + 1). 
Denote un, TO ] by z. Because \z\ = to and z has a presentation in K then z must not have a 
presentation in L. But then all presentations of z must end in some vertex of K: Any walk 
in G which enters the terminal subgraph K is not be able to leave K. (Had z a presentation 
beginning and ending in L then the whole presentation would lie in L.) 

Because it is v = zv[ m+ i t q we see that any presentation of v must enter K after at most to 
edges. Taking any presentation of v in G then gives us a presentation of f[ m +i ; ;] in K. 

□ 

Now comes the core theorem of this section that uses all the above results to describe the 
relation C. 

Theorem 14. Let S be a sofic subshift, G a labelled graph, £ = £(G). Let x,y £ £. T/ien 
(x,y) G C iff lu(x) < a(y) or y = o~ n (x) for some n > 0. 

Proof. Obviously, if j/ = <7 n (x) for n > then (:e,j/) G C. Assume now that u)(x) < a(y). Using 
Proposition [3l it suffices to prove that there exists a chain from w = £[-u] to w' = y\-i t i] for any 

leE. 

Let u be the ending vertex of some presentation of w and u' be the starting vertex of some 
presentation of w' . As uj(x) < a{y), there exists a generalised walk u = Zq,Z\, . .., z n -i, z n = u' in 
G. We want to take this walk and turn it into a chain. More precisely, we want to find a chain 
from w to some w" such that one presentation of w" ends in v! . As w"w' is a chain from w" to 
in' , by composing both chains we get a chain from u> to it/ of nonzero length (see Figure [7]). 
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Figure 7: Turning a generalised walk to a chain. 



V[-i,i] 



n 




r4 fr> 










2/[ n] — cr "( a; )[ i+n.J 

71— 2 




;ure 8: Proving that p(a n (x 


),y) <2 



To produce the chain from w to w", use mathematical induction on n: 

1. If n = then the existence of the chain follows directly from Lemma [T2l 

2. Let the theorem hold for n — l. An edge (labelled by some letter a) leads from z n -i to a 
vertex z' « z n . Using the induction hypothesis, there exists a chain from w to some word v 
whose presentation ends in But then we also have the chain va from v to v' = VraiuiiCi. 
The word v' has a presentation that ends in z' and, using Lemma [T^] again, we get that there 
exists a chain from v' to some w/' whose presentation ends in z n = u' . 

In the other direction, let (x,y) 6 C. Using Proposition [3] we get that for every I > there 
exists a chain of nonzero length from air— z,ji to yr-jn. Denote by n\ the minimum (nonzero) length 
of such a chain. Since a chain for I can be easily obtained from a chain for k > I by forgetting the 
prefix and suffix, {n{\^ 1 is a nondecreasing sequence. We shall distinguish two cases: 

1. Let the sequence be bounded and let n = max{n/|/ € N}. We claim that then y = <j n (x). 
Indeed, for all but finitely many values of I we have ni = n and if a chain of length n leads 
from X[_i t q to y\-i ; i] then p(a n (x),y) < 2 n ~ l (see Figure H]). Thus p(cr n (x),y) — and 
y = a n (x). 

2. Let the sequence be unbounded. We want to prove that then a(y) > ui(x). 

Let p be a presentation of x in G and let h be an index such that the walk P[h,ao] belongs to 
only one component M of G. Without loss of generality assume that h > 0. 

Take an arbitrary I > 0. As the sequence is not bounded, there exists k > h + 2l such 

that nk > I + h. The chain from £t?r fe,fe] to y\-k,k] contains as a factor the chain from x^.h+21] 

to £/[-/,;]■ Here we use that k > h + 21 to ensure that x^ ^^i] E ^[-fc^] and nfe > / + h to 
ensure that the length of the chain is positive. See Figure [5] 

For Z sufficiently large, we can use Lemma 1131 with u = x\f l ,h+2i]j v = Vl-IM an d H equal 
to the minimal terminal subgraph of G/~ containing lu(x), obtaining that y[ m -i t i\ has a 
presentation in the graph K — 7r _1 (_ff). Because m is a constant and is compact we 

get that the whole word y belongs to £(-&"). 



9 



-k x [-k,k] k V[-k,k} 









< 


> 












< 


► 

rtfc — h — I 




x [h,h+2l\ 


Vl-Ll] 



Figure 9: The chain from asr^w to y[-k,k] an d the chain from X[ hh+2 i] to y\-in- 



Now it remains to observe that the graph tt(K) = H is precisely the graph of all vertices 
reachable from u)(x). That means that a{y) > u){x) as one presentation of y begins in K. 

□ 

Corollary 15. Let L = C{G) be a central regular language, G an essential labelled graph. Then, 
L is chain-transitive iff G / ~ is connected. 

Proof. Using Theorem 1141 we see that if G/~ is connected then L must be chain-transitive. 

In the other direction, assume by contradiction that if is a terminal component of G/~, L is 
an initial component of G/~ and K ^ L. The preimages 7r~ 1 (ii') resp. 7r _1 (L) must contain at 
least one terminal resp. initial component of G. As G is essential, there must exist x G E(7r _1 (i ; C)) 
and y G S(7r _1 (L)). Were y — a n (x) we would get a contradiction with Remark [3] and so, using 
Theorem [T4l we get K < L, a contradiction. Thus K = L and G is connected. □ 

4 Chain-Mixing Sofic Subshifts 

We have seen that (for G essential) we can translate the question of chain-transitivity to a question 
about the structure of G/~. In this section we characterise the chain-mixing property using the 
structure of G/~. 

Theorem 16. Let L = C{G) be a central regular language, G essential. Then, L is the language 
of a chain-mixing subshift iff G / ~ is connected and aperiodic. 

Proof. First let us prove the necessity of the conditions. Let L be chain-mixing. Then L is also 
chain-transitive and, as follows from Corollary [TSJ G/~ must be connected. 

Let us now assume that G/~ is periodic with a period n > 1. Let w £ C(G) be a word that 
is long enough to satisfy the conditions of Corollary [TO] We claim that then all ending vertices 
of all presentations of w must be projected to the same partition V% of G/~. Let v\V2 ■ ■ - V\ w \ 
and . . ■v\ w \ be two walks presenting w. Then Corollary 1101 tells us that there exists r such 
that v r « v' r . But then 7r(u r ) = ir{v' r ) s Vj for some j and thus 7r(f r +i), 7r(u,'. +1 ) G V}+i and 
■K(vr+2),Tr(v' r+2 ) G V j+ 2 and so on, ending with n(v\ w \), tt(v'^) G V( j+ \ w \- r ) mod «■ 

Take some w such that |w| = I + 1 where I is the constant from Corollary [TO] Let z be a 
chain from w to w of length m. We claim that then n\m. This will be a contradiction to the 
chain- mixing property (via Corollary 2]) . Without loss of generality, let Vb contain all the end 
vertices of all presentations of w. Then z^+z] and Z[i+i,i+z+i] share a factor of length I and so, 
due to the above remark, if presentations of z^ iji+ ^ all end in Vt then presentations of Z[i+i,i+i+i] 
all end in Vu+u moc i „. But then V m mo d n = Vo and so n\m. 

In the opposite direction, assume we have a labelled graph G such that G/~ is connected and 
aperiodic and L — C(G). We want to show that L is chain-mixing. We begin by showing that for 
any word w there exists a k such that we can find chains of any length n > k from w to w. 

Take the least common divisor d of the lengths of all chains from w to w. Notice that all closed 
walks in G must have lengths divisible by d, otherwise we could easily produce chains from w to 
w of lengths not divisible by d (L is chain-transitive). 
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Assume that d > 1. Let v be the ending vertex of one presentation of w. As G/~ is aperiodic 
and connected, there exists a generalised walk in G of some length I not divisible by d from v back 
to v. Using the same algorithm as in the first part of proof of Theorem 1141 we obtain a chain from 
w to w. Each closed walk in G has length divisible by d and in the proof of Theorem [14j we have 
produced the chain from the generalised walk by adding only words presented by closed walks. 
This means that the length of our chain gives the same remainder when divided by d as the length 
I of the corresponding generalised walk. Thus we get a chain from w to w of length not divisible 
by d, a contradiction. 

If d = 1 then there must exist chains from w to w of lengths £i, I2, ■ ■ ■ , lp such that the least 
common divisor of l\, . . . , l p is 1. 

Lemma 17. Let {Zi, . . . l p } be a set of positive integers whose greatest common divisor is 1. Then 
there exists k such that every n > k can be written as n = r\l\ + r-zLi + • ■ • + r p l p where are 
positive integers or zeroes. 

Proof. First notice that for some integers Si it is 1 = sil\ + S2I2 + • • ■ + s p l p because Z is a principal 
ideal domain. Set m = \s\\l\ + \s2\l2 + • • • + |s p |Z p . 

Let k — mli. If now n = q ■ l\ + 1 where t 6 {0, 1, 2, . . . , l\ — 1} and q > m then it is 

n = (q — m)h + mh + t(sih + S2I2 + ■ ■ ■ + Spl p ) 
p 

n = (g - m)li + ^2(tsj + \si\h)lj 
1=1 



Letting r\ = q — m + tsi + /j.|si| and = tSi + h\si\ for i = 2, 3, . . . ,p we obtain > such that 

Using Lemma [T71 we see that for any n > k we can compose the chains from w to w to obtain 
a chain of length n. 

Having found chains from w to w of any length n > k we want to find chains from w to some 
arbitrary z. As L is chain-transitive, there exists a chain from w to z of length k' . By composing 
this chain with a suitable chain from w to w of length n > k we can obtain a chain from w to z of 
any length n' = n + k' > k + k'. As we can do this (with different k, k') for all w, z, the language 
L must be chain-mixing. □ 



5 Attractors of Sofic Subshifts 

In this section we use the linking graph to characterise all the attractors of the dynamical system 
(£,cr) when S is a sofic subshift. There are several slightly different definitions of attractor. We 
shall use the following one (from [3]): 

Definition 18. Denote by d(x,Y) the distance of the point x € X from the set Y C X in X. 
An attractor Y of a dynamical system (X, F) is a nonempty closed subset of X that satisfies the 
following: 

1. F(Y) = Y 

2. Ve > 0, 35 > 0, € X, d(x, Y) < 6 => Vn > 0, d(F n (x),Y) < e 

3. 36 > 0, Vx € X, d(x, Y) <5^ lim d(F n (x),Y) = 0. 

n — >oo 

There are several theorems that show the correspondence between attractors and the chain 
relation. We use the following theorem. 

Theorem 19. p. 82] Let Q be an attractor. Then f2 is chain-invariant, i.e. Vz S f2, (z,y) G 

c^y en. 
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Theorem 20. Let G be an essential labelled graph. Let H be a nonempty terminal subgraph of 
G/~. Then the set Jl = E(7r _1 (i?)) is an attractor of (£(G),er). Moreover, all attractors of 
(£(G),ct) are of this type. 

Proof. We shall first prove that fl is indeed an attractor. Obviously, it is a subshift and so it is 
shift-invariant and closed. As tt~ 1 (H) is nonempty and essential, f2 is also nonempty. 

Because H is a terminal subgraph of G/~, we can utilise Remark [3] and obtain m such that if 

> m then v may not lie both in the language of 7r _1 (iJ) and the language of G \ 7r _1 (iJ). 

Without loss of generality, let e = 2~ fe , k > 0. Let S — 2~ rn ~ k . If d(x, Q) < 5 then x< m _ kt _ k _ii 
does not belong to the language of G \ 7r _1 (iJ). Thus all presentations of a;[- m -fc,-fc-i] en d in 
a vertex of ir~ 1 (H) and cc[_fc i0 o) has a presentation in 7r _1 (i/). Then cr"(:£)[-fc.fc] ~ x [-k+n.k+n] 
belongs to the language £(7r _1 (7J)) and so d(a n (x), tt) < e for all n > 0. 

Similarly letting S = 2~ m yields that if d(x,fl) < 5 then o- n (x)[_ n n ] = X[ 2n ] belongs to 
£(-7r -1 (-ff)) and so d(a n (x) , O) < 2~™ for all n. As n tends to infinity then d(a n (x) , fi) tends to 
zero, concluding the proof that f2 is an attractor. 

Let now fl be an attractor of the subshift system. Let x £ Q and take a presentation p of i 
in G. Let K be the component of G where p starts. Let H be the terminal subgraph of G/^ 
generated by ir(K) (subgraph induced by all vertices that can be reached from ir{K)). We claim 
that E(-!t- 1 (H)] )C ft. 

To prove this claim, we just have to prove that there exists z G ^(K) n 51 and use Theorem 
RBI and the chain-invariance of attractors (Theorem flT)|) . Then E(7r~ 1 (_ff)) is precisely the set of 
words of S that z is in the chain relation with and so S(7r _1 (iJ)) C fl. 

As A z is a compact space and fi is a closed subspace of A z , f2 is a compact set. Thus any 
sequence in f2 has an accumulation point. Because is cr-invariant it contains all the words 
x n = cr~ n {x), n £ N. This sequence has an accumulation point z 6 tt. For any m > there exists 
k > such that X(_ 00 ._ / r i;+m ] has a presentation in K and Z[_ m ,m] = X[-k- m .-k+m] > t nus ^[-m.m] 
has a presentation in for any m. This means that z S S(A"). But then f2 is a union of subshifts 
of the form S(7r _1 (7J 2; )) where H x are terminal subgraphs of G/~ and x e 51. While it is not in 
general true that S(G) U = E(GU H), in this special case we can use to our advantage the 

fact that all H x are terminal and so all walks in H are contained in at least one H x . Thus by 
taking H = 1J H x we get J! = S(7r _1 (_ff)) where if is a terminal subgraph of G/~. □ 

6 Algorithmic checking of properties 

In this section we describe the algorithms that, given an essential labelled graph G, construct the 
graph G I ~ and check whether S(G) is chain-transitive or chain-mixing. As there is presently little 
need for practical implementations of such algorithms, we provide only very broad descriptions. 

The construction of G/~ The proposed algorithm is quite straightforward: It first finds all 
pairs of linked vertices and then joins such pairs of vertices together. We shall use the double 
depth-first search algorithm from [TJ p. 489] that finds all components of a given graph in time 
0(\V\ + \E\). 

Definition 21. Given two labelled graphs G,H we can construct their label product G * H: a 
graph with the vertex set V(G) x V(H) and edge set {(e,/) : e 6 E{G)J E E(H),l(e) = 1(f)}. 
That is, an edge leads from (u, v) to (u' , v') iff the edges from u to u' and from v to v' exist and 
have the same labels. 

Algorithm 6.1. Given G construct G/~. 

1. Find all components C\, . . . , Gfe of G. 

2. Construct the label products Ci * Cj of all pairs of components. 
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3. For each i < j find all components of Ci * Cj that contain an oriented cycle. Paint these 
components red. Let Gij be the subgraph of Ci * Cj induced by the set of vertices 

V(Gij) = {v G V(Ci * Cj)\there exists a walk from v to some red component}. 

4- It is u ~ v iff (u, v) G V(Gij) for a suitable 

5. Join together all pairs of linked vertices in G to obtain G/~. 

To prove the correctness of the algorithm, it suffices to show that (it, v) G V(G»j) iff u ~ v for 

ueVid^veViCj). 

Proof. If u ~ v then there exists a word w of length n = \V(G)\ 2 + 1 with a presentation ei . . . e n in 
Cj starting in it and a presentation . . . in Cj starting in v. Then (ei, e' 1 )(e2, e^) ■ ■ ■ (e„, e^) is a 
walk in C, * Cj. Because Ci * Cj has at most |U(G)| 2 vertices, this walk must return to an already 
visited vertex at least once and that is only possible in a red component. Thus (u,v) G V(Gij). 

On the other hand, if (it, i/) G V{Gtj) then we can find arbitrarily long walks that start at 
(it, v). Let (ei, e'i) . . . (e„, e' n ) be such a walk of length n. Then it is l(ei) = for all i and so 
ei . . . e„ and e[ . . . e' n are two presentations of the same word starting in it and v. We can do this 
for any n and so it ~ v. □ 

Checking for chain transitivity Using the algorithm from [TJ p. 489] again we can easily 
check whether G/~ is connected. 

Checking the chain-mixing property We need to check that G/~ is connected and find the 
period of G/~. This can be done in linear time using breadth-first search as described in [3]. 

Attractors It suffices to write down all nonempty terminal subgraphs of G/~. Since the number 
of such subgraphs may in general be exponential in the number of components of G/~, there can 
be no fast algorithm. A backtracking algorithm can be used here. 

Complexity of the algorithms The first three algorithms run in polynomial time. The creation 
of G/~ demands the most time while the checks of connectivity and aperiodicity both run in time 
0(\V\ + \E\). 

Unfortunately, the number of terminal subgraphs may be exponential to the size of the input 
graph so outputting all the attractors of the shift dynamic system is in general not very practical 
for graphs with many components. 

7 Conclusions 

In this paper, we have introduced and used the notion of linking graph to better understand sofic 
subshifts of A z . It turns out that we can characterise the relation C and all attractors of S using 
the properties of G/~. 

We have proposed straightforward algorithms to decide in polynomial time (to the size of graph 
G describing E) whether a sofic subshift S is chain-mixing or chain-transitive and an algorithm 
that, given an essential graph G, lists (generally not in polynomial time) all the attractors of 
(S(G),a). 

It is interesting that while chain-transitivity can be decided in polynomial time, deciding tran- 
sitivity is co-NP hard, as was recently shown in [6]. The cause of this contrast seems to be that 
when deciding transitivity, we have (explicitely or implicitely) to decide wheteher for H, H graphs 
it is S(i?) C 'S(H'), a hard question, while deciding chain-transitivity requies mereley that we 
decide if T,(H) n S(iJ') ^ 0, an easy question. 

Linking graphs have proven useful in describing the properties of E that depend mainly on C. 
Other properties of linking graphs might be a nice subject for further research. 
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